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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Svendsen, Allan 

Bisgard- Frant zen , Henr ik 
Borchert, Torben Vedel 

(ii) TITLE OF INVENTION: Qf-Amylase Mutants 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novo Nordisk of North America, Inc. 

(B) STREET: 405 Lexington Avenue, 64th Floor 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States of America 

(F) ZIP: 10174-6401 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 03-JUN-1999 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME : Green, Reza 

(B) REGISTRATION NUMBER: 38,475 

(C) REFERENCE/DOCKET NUMBER: 4394.214-US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212-867-0123 

(B) TELEFAX: 212-878-9655 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 334.. 1869 

(ix) FEATURE: 

(A) NAME / KEY : sig_peptide 

(B) LOCATION: 334.. 420 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 421.. 1869 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CGGAAGATTG GAAGTACAAA AATAAGCAAA AGATTGTCAA TCATGTCATG AGCCATGCGG 60 
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GAGACGGAAA 


AATCGTCTTA 


ATGCACGATA 


TTTATGCAAC 


GTTCGCAGAT 


GCTGCTGAAG 


12 0 


AGATTATTAA 


AAAGCTGAAA 


GCAAAAGGCT 


ATCAATTGGT 


AACTGTATCT 


CAGCTTGAAG 


180 


AAGTGAAGAA 


GCAGAGAGGC 


TATTGAATAA 


ATGAGTAGAA 


GCGCCATATC 


GGCGCTTTTC 


240 


TTTTGGAAGA 


AAATATAGGG 


AAAATGGTAC 


TTGTTAAAAA 


TTCGGAATAT 


TTATACAACA 


300 


TCATATGTTT 


CACATTGAAA 


GGGGAGGAGA 


ATC ATG AAA CAA CAA AAA CGG CTT 


354 



Met Lys Gin Gin Lys Arg Leu 
-29 -25 

TAC GCC CGA TTG CTG ACG CTG TTA TTT GCG CTC ATC TTC TTG CTG CCT 4 02 

Tyr Ala Arg Leu Leu Thr Leu Leu Phe Ala Leu lie Phe Leu Leu Pro 
-20 -15 -10 

CAT TCT GCA GCA GCG GCG GCA AAT CTT AAT GGG ACG CTG ATG CAG TAT 4 50 

His Ser Ala Ala Ala Ala Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr 
-5 15 10 

TTT GAA TGG TAC ATG CCC AAT GAC GGC CAA CAT TGG AGG CGT TTG CAA 4 98 

Phe Glu Trp Tyr Met Pro Asn Asp Gly Gin His Trp Arg Arg Leu Gin 
15 20 25 

AAC GAC TCG GCA TAT TTG GCT GAA CAC GGT ATT ACT GCC GTC TGG ATT 546 
Asn Asp Ser Ala Tyr Leu Ala Glu His Gly lie Thr Ala Val Trp lie 
30 35 40 

CCC CCG GCA TAT AAG GGA ACG AGC CAA GCG GAT GTG GGC TAC GGT GCT 594 
Pro Pro Ala Tyr Lys Gly Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala 
45 50 55 

TAC GAC CTT TAT GAT TTA GGG GAG TTT CAT CAA AAA GGG ACG GTT CGG 642 
Tyr Asp Leu Tyr Asp Leu Gly Glu Phe His Gin Lys Gly Thr Val Arg 
60 65 70 

ACA AAG TAC GGC ACA AAA GGA GAG CTG CAA TCT GCG ATC AAA AGT CTT 690 
Thr Lys Tyr Gly Thr Lys Gly Glu Leu Gin Ser Ala lie Lys Ser Leu 
75 80 85 90 

CAT TCC CGC GAC ATT AAC GTT TAC GGG GAT GTG GTC ATC AAC CAC AAA 73 8 

His Ser Arg Asp lie Asn Val Tyr Gly Asp Val Val lie Asn His Lys 
95 100 105 

GGC GGC GCT GAT GCG ACC GAA GAT GTA ACC GCG GTT GAA GTC GAT CCC 78 6 

Gly Gly Ala Asp Ala Thr Glu Asp Val Thr Ala Val Glu Val Asp Pro 
110 115 120 

GCT GAC CGC AAC CGC GTA ATT TCA GGA GAA CAC CTA ATT AAA GCC TGG 834 
Ala Asp Arg Asn Arg Val lie Ser Gly Glu His Leu lie Lys Ala Trp 
125 130 135 

ACA CAT TTT CAT TTT CCG GGG CGC GGC AGC ACA TAC AGC GAT TTT AAA 8 82 

Thr His Phe His Phe Pro Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys 
140 145 150 

TGG CAT TGG TAC CAT TTT GAC GGA ACC GAT TGG GAC GAG TCC CGA AAG 930 
Trp His Trp Tyr His Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys 
155 160 165 170 

CTG AAC CGC ATC TAT AAG TTT CAA GGA AAG GCT TGG GAT TGG GAA GTT 978 
Leu Asn Arg lie Tyr Lys Phe Gin Gly Lys Ala Trp Asp Trp Glu Val 
175 180 185 

TCC AAT GAA AAC GGC AAC TAT GAT TAT TTG ATG TAT GCC GAC ATC GAT 1026 
Ser Asn Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp lie Asp 
190 195 200 




TAT GAC CAT CCT GAT GTC GCA GCA GAA ATT AAG AGA TGG GGC ACT TGG 1074 

Tyr Asp His Pro Asp Val Ala Ala Glu lie Lys Arg Trp Gly Thr Trp 
205 210 215 

TAT GCC AAT GAA CTG CAA TTG GAC GGT TTC CGT CTT GAT GCT GTC AAA 1122 

Tyr Ala Asn Glu Leu Gin Leu Asp Gly Phe Arg Leu Asp Ala Val Lys 
220 225 230 

CAC ATT AAA TTT TCT TTT TTG CGG GAT TGG GTT AAT CAT GTC AGG GAA 117 0 

His lie Lys Phe Ser Phe Leu Arg Asp Trp Val Asn His Val Arg Glu 

235 240 245 250 

AAA ACG GGG AAG GAA ATG TTT ACG GTA GCT GAA TAT TGG CAG AAT GAC 1218 

Lys Thr Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp . 

255 260 265 

TTG GGC GCG CTG GAA AAC TAT TTG AAC AAA ACA AAT TTT AAT CAT TCA 12 66 

Leu Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Phe Asn His Ser 
270 275 280 

GTG TTT GAC GTG CCG CTT CAT TAT CAG TTC CAT GCT GCA TCG ACA CAG 1314 

Val Phe Asp Val Pro Leu His Tyr Gin Phe His Ala Ala Ser Thr Gin 
285 290 295 

GGA GGC GGC TAT GAT ATG AGG AAA TTG CTG AAC GGT ACG GTC GTT TCC 13 62 

Gly Gly Gly Tyr Asp Met Arg Lys Leu Leu Asn Gly Thr Val Val Ser 
300 305 310 

AAG CAT CCG TTG AAA TCG GTT ACA TTT GTC GAT AAC CAT GAT ACA CAG 1410 

Lys His Pro Leu Lys Ser Val Thr Phe Val Asp Asn His Asp Thr Gin 

315 320 325 330 

CCG GGG CAA TCG CTT GAG TCG ACT GTC CAA ACA TGG TTT AAG CCG CTT 14 58 

Pro Gly Gin Ser Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu 
335 340 345 

GCT TAC GCT TTT ATT CTC ACA AGG GAA TCT GGA TAC CCT CAG GTT TTC 15 06 

Ala Tyr Ala Phe lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe 
350 355 360 

TAC GGG GAT ATG TAC GGG ACG AAA GGA GAC TCC CAG CGC GAA ATT CCT 15 54 

Tyr Gly Asp Met Tyr Gly Thr Lys Gly Asp Ser Gin Arg Glu lie Pro 
365 370 375 

GCC TTG AAA CAC AAA ATT GAA CCG ATC TTA AAA GCG AGA AAA CAG TAT 1602 

Ala Leu Lys His Lys lie Glu Pro lie Leu Lys Ala Arg Lys Gin Tyr 
380 385 390 

GCG TAC GGA GCA CAG CAT GAT TAT TTC GAC CAC CAT GAC ATT GTC GGC 16 5 0 

Ala Tyr Gly Ala Gin His Asp Tyr Phe Asp His His Asp lie Val Gly 

395 400 405 410 

TGG ACA AGG GAA GGC GAC AGC TCG GTT GCA AAT TCA GGT TTG GCG GCA 1698 

Trp Thr Arg Glu Gly Asp Ser Ser Val Ala Asn Ser Gly Leu Ala Ala 
415 420 425 

TTA ATA ACA GAC GGA CCC GGT GGG GCA AAG CGA ATG TAT GTC GGC CGG 1746 

Leu lie Thr Asp Gly Pro Gly Gly Ala Lys Arg Met Tyr Val Gly Arg 
430 435 440 

CAA AAC GCC GGT GAG ACA TGG CAT GAC ATT ACC GGA AAC CGT TCG GAG 1794 

Gin Asn Ala Gly Glu Thr Trp His Asp lie Thr Gly Asn Arg Ser Glu 
445 450 455 

CCG GTT GTC ATC AAT TCG GAA GGC TGG GGA GAG TTT CAC GTA AAC GGC 1842 

Pro Val Val lie Asn Ser Glu Gly Trp Gly Glu Phe His Val Asn Gly 
460 465 470 
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GGG TCG GTT TCA ATT TAT GTT CAA AGA TAGAAGAGCA GAGAGGACGG 
Gly Ser Val Ser lie Tyr Val Gin Arg 
475 480 



ATTTCCTGAA GGAAATCCGT TTTTTTATTT T 



1889 
1920 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Gin Gin Lys Arg Leu Tyr Ala Arg Leu Leu Thr Leu Leu Phe 
-29 -25 -20 -15 

Ala Leu lie Phe Leu Leu Pro His Ser Ala Ala Ala Ala Ala Asn Leu 
-10 -5 1 

? ~~ Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro Asn Asp Gly 

yJ 5 10 15 

fl\ Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu Ala Glu His 

jjSj 20 25 30 35 

^ Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly Thr Ser Gin 

£□ 40 45 50 

2" Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 

JL 55 60 65 

CP His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Gly Glu Leu 

Q 70 75 80 

^ Gin Ser Ala lie Lys Ser Leu His Ser Arg Asp lie Asn Val Tyr Gly 

^ 85 90 95 

Asp Val Val lie Asn His Lys Gly Gly Ala Asp Ala Thr Glu Asp Val 
100 105 110 115 

Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val lie Ser Gly 
120 125 130 

Glu His Leu lie Lys Ala Trp Thr His Phe His Phe Pro Gly Arg Gly 
135 140 145 

Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly Thr 
150 155 160 

Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg lie Tyr Lys Phe Gin Gly 
165 170 175 

Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn Tyr Asp Tyr 
180 185 190 195 

Leu Met Tyr Ala Asp lie Asp Tyr Asp His Pro Asp Val Ala Ala Glu 
200 205 210 

lie Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin Leu Asp Gly 
215 220 225 
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Phe Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe Leu Arg Asp 
230 235 240 

Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met Phe Thr Val 
245 250 255 

Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn Tyr Leu Asn 
260 265 270 275 

Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu His Tyr Gin 
280 285 290 

Phe His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met Arg Lys Leu 
295 300 305 

Leu Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Ser Val Thr Phe 
310 315 320 

Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser Thr Val 
325 330 335 

Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr Arg Glu 
340 345 350 355 

Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly Thr Lys Gly 
360 365 370 

Asp Ser Gin Arg Glu lie Pro Ala Leu Lys His Lys lie Glu Pro lie 
375 380 385 

Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His Asp Tyr Phe 
390 395 400 

Asp His His Asp lie Val Gly Trp Thr Arg Glu Gly Asp Ser Ser Val 
405 410 415 

Ala Asn Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro Gly Gly Ala 
420 425 430 435 

Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr Trp His Asp 
440 445 450 

lie Thr Gly Asn Arg Ser Glu Pro Val Val lie Asn Ser Glu Gly Trp 
455 460 465 

Gly Glu Phe His Val Asn Gly Gly Ser Val Ser lie Tyr Val Gin Arg 
470 475 480 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08i4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 250.. 1791 

(ix) FEATURE: 

(A) NAME/KEY: sigj>eptide 

(B) LOCATION: 250.. 342 




(ix) FEATURE: 

(A) NAME / KEY : mat_peptide 

(B) LOCATION: 343 .. 1791 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GCCCCGCACA TACGAAAAGA CTGGCTGAAA ACATTGAGCC TTTGATGACT GATGATTTGG 60 

CTGAAGAAGT GGATCGATTG TTTGAGAAAA GAAGAAG AC C ATAAAAATAC CTTGTCTGTC 12 0 

ATCAGACAGG GTATTTTTTA TGCTGTCCAG ACTGTCCGCT GTGTAAAAAT AAGGAATAAA 18 0 

GGGGGGTTGT TATTATTTTA CTGATATGTA AAATATAATT TGTATAAGAA AATGAGAGGG 24 0 

AGAGGAAAC ATG ATT CAA AAA CGA AAG CGG ACA GTT TCG TTC AGA CTT 2 88 
Met lie Gin Lys Arg Lys Arg Thr Val Ser Phe Arg Leu 
-31 -30 -25 -20 

GTG CTT ATG TGC ACG CTG TTA TTT GTC AGT TTG CCG ATT ACA AAA ACA 3 36 
Val Leu Met Cys Thr Leu Leu Phe Val Ser Leu Pro lie Thr Lys Thr 
-15 -10 -5 

TCA GCC GTA AAT GGC ACG CTG ATG CAG TAT TTT GAA TGG TAT ACG CCG 3 84 
Ser Ala Val Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro 
"5 15 10 

W AAC GAC GGC CAG CAT TGG AAA CGA TTG CAG AAT GAT GCG GAA CAT TTA 43 2 
?y Asn Asp Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ala Glu His Leu 
15 20 25 30 



in 



TCG GAT ATC GGA ATC ACT GCC GTC TGG ATT CCT CCC GCA TAC AAA GGA 480 

Ser Asp lie Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly 
35 40 45 

TTG AGC CAA TCC GAT AAC GGA TAC GGA CCT TAT GAT TTG TAT GAT TTA 52 8 

Leu Ser Gin Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu 
50 55 60 

GGA GAA TTC CAG CAA AAA GGG ACG GTC AGA ACG AAA TAC GGC ACA AAA 5 76 

Gly Glu Phe Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 
65 70 75 

TCA GAG CTT CAA GAT GCG ATC GGC TCA CTG CAT TCC CGG AAC GTC CAA 624 

Ser Glu Leu Gin Asp Ala lie Gly Ser Leu His Ser Arg Asn Val Gin 

80 85 90 

GTA TAC GGA GAT GTG GTT TTG AAT CAT AAG GCT GGT GCT GAT GCA ACA 6 72 

Val Tyr Gly Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr 

95 100 105 110 

GAA GAT GTA ACT GCC GTC GAA GTC AAT CCG GCC AAT AGA AAT CAG GAA 72 0 

Glu Asp Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu 
115 120 125 

ACT TCG GAG GAA TAT CAA ATC AAA GCG TGG ACG GAT TTT CGT TTT CCG 768 

Thr Ser Glu Glu Tyr Gin lie Lys Ala Trp Thr Asp Phe Arg Phe Pro 
130 135 140 

GGC CGT GGA AAC ACG TAC AGT GAT TTT AAA TGG CAT TGG TAT CAT TTC 816 

Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 
145 150 155 

GAC GGA GCG GAC TGG GAT GAA TCC CGG AAG ATC AGC CGC ATC TTT AAG 864 

Asp Gly Ala Asp Trp Asp Glu Ser Arg Lys lie Ser Arg lie Phe Lys 

160 165 170 

TTT CGT GGG GAA GGA AAA GCG TGG GAT TGG GAA GTA TCA AGT GAA AAC 912 
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Phe Arg Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn 
175 180 185 190 

GGC AAC TAT GAC TAT TTA ATG TAT GCT GAT GTT GAC TAC GAC CAC CCT 960 
Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro 
195 200 205 

GAT GTC GTG GCA GAG ACA AAA AAA TGG GGT ATC TGG TAT GCG AAT GAA 10 08 

Asp Val Val Ala Glu Thr Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu 
210 215 220 

CTG TCA TTA GAC GGC TTC CGT ATT GAT GCC GCC AAA CAT ATT AAA TTT 10 56 

Leu Ser Leu Asp Gly Phe Arg lie Asp Ala Ala Lys His lie Lys Phe 
225 230 235 

TCA TTT CTG CGT GAT TGG GTT CAG GCG GTC AGA CAG GCG ACG GGA AAA 1104 
Ser Phe Leu Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys 
240 245 250 

GAA ATG TTT ACG GTT GCG GAG TAT TGG CAG AAT AAT GCC GGG AAA CTC 1152 
Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu 
255 260 265 270 

GAA AAC TAC TTG AAT AAA ACA AGC TTT AAT CAA TCC GTG TTT GAT GTT 12 00 

Glu Asn Tyr Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val 
275 280 285 

CCG CTT CAT TTC AAT TTA CAG GCG GCT TCC TCA CAA GGA GGC GGA TAT 124 8 

Pro Leu His Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr 
290 295 300 

GAT ATG AGG CGT TTG CTG GAC GGT ACC GTT GTG TCC AGG CAT CCG GAA 12 96 

Asp Met Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu 
305 310 315 

AAG GCG GTT ACA TTT GTT GAA AAT CAT GAC ACA CAG CCG GGA CAG TCA 13 44 

Lys Ala Val Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser 
320 325 330 

TTG GAA TCG ACA GTC CAA ACT TGG TTT AAA CCG CTT GCA TAC GCC TTT 13 92 

Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe 
335 340 345 350 

ATT TTG ACA AGA GAA TCC GGT TAT CCT CAG GTG TTC TAT GGG GAT ATG 144 0 

lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met 
355 360 365 

TAC GGG ACA AAA GGG ACA TCG CCA AAG GAA ATT CCC TCA CTG AAA GAT 14 88 

Tyr Gly Thr Lys Gly Thr Ser Pro Lys Glu lie Pro Ser Leu Lys Asp 
370 375 380 

AAT ATA GAG CCG ATT TTA AAA GCG CGT AAG GAG TAC GCA TAC GGG CCC 153 6 

Asn lie Glu Pro lie Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro 
385 390 395 

CAG CAC GAT TAT ATT GAC CAC CCG GAT GTG ATC GGA TGG ACG AGG GAA 1584 
Gin His Asp Tyr lie Asp His Pro Asp Val lie Gly Trp Thr Arg Glu 
400 405 410 

GGT GAC AGC TCC GCC GCC AAA TCA GGT TTG GCC GCT TTA ATC ACG GAC 16 32 

Gly Asp Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu lie Thr Asp 
415 420 425 430 

GGA CCC GGC GGA TCA AAG CGG ATG TAT GCC GGC CTG AAA AAT GCC GGC 16 8 0 

Gly Pro Gly Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly 
435 440 445 
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GAG ACA TGG TAT GAC ATA ACG GGC AAC CGT TCA GAT ACT GTA AAA ATC 172 8 

Glu Thr Trp Tyr Asp lie Thr Gly Asn Arg Ser Asp Thr Val Lys lie 
450 455 460 

GGA TCT GAC GGC TGG GGA GAG TTT CAT GTA AAC GAT GGG TCC GTC TCC 17 76 

Gly Ser Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser 
465 470 475 

ATT TAT GTT CAG AAA TAAGGTAATA AAAAAACACC TCCAAGCTGA GTGCGGGTAT 1831 
lie Tyr Val Gin Lys 
480 

CAGCTTGGAG GTGCGTTTAT TTTTTCAGCC GTATGACAAG GTCGGCATCA GGTGTGACAA 1891 

ATACGGTATG CTGGCTGTCA TAGGTGACAA ATCCGGGTTT TGCGCCGTTT GGCTTTTTCA 1951 

CATGTC TGAT TTTTGTATAA TCAACAGGCA CGGAGCCGGA ATCTTTCGCC TTGGAAAAAT 2 011 

AAGCGGCGAT CGTAGCTGCT TCCAATATGG ATTGTTCATC GGGATCGCTG CTTTTAATCA 2 071 

CAACGTGGGA TCC 2 084 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 
<B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met lie Gin Lys Arg Lys Arg Thr Val Ser Phe Arg Leu Val Leu Met 

-31 -30 -25 -20 

Cys Thr Leu Leu Phe Val Ser Leu Pro lie Thr Lys Thr Ser Ala Val 

-15 -10 -5 1 

Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro Asn Asp Gly 
5 10 15 

Gin His Trp Lys Arg Leu Gin Asn Asp Ala Glu His Leu Ser Asp lie 
20 25 30 

Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly Leu Ser Gin 
35 40 45 

Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
50 55 60 65 

Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu Leu 
70 75 80 

Gin Asp Ala lie Gly Ser Leu His Ser Arg Asn Val Gin Val Tyr Gly 
85 90 95 

Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr Glu Asp Val 
100 105 110 

Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu Thr Ser Glu 
115 120 125 

Glu Tyr Gin lie Lys Ala Trp Thr Asp Phe Arg Phe Pro Gly Arg Gly 
130 135 140 145 




Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly Ala 
150 155 160 

Asp Trp Asp Glu Ser Arg Lys lie Ser Arg lie Phe Lys Phe Arg Gly 
165 170 175 

Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn Tyr 
180 185 190 

Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val Val 
195 200 205 

Ala Glu Thr Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu Leu Ser Leu 
210 215 220 225 

Asp Gly Phe Arg lie Asp Ala Ala Lys His lie Lys Phe Ser Phe Leu 
230 235 240 

Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys Glu Met Phe 
245 250 255 

Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu Glu Asn Tyr 
260 265 270 

fg Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val Pro Leu His 
"~ 275 280 285 

5^ Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr Asp Met Arg 

rU 290 295 300 305 
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Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala Val 
310 315 320 

Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser 
325 330 335 

Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr 
340 345 350 

Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly Thr 
355 360 365 

Lys Gly Thr Ser Pro Lys Glu lie Pro Ser Leu Lys Asp Asn lie Glu 
370 375 380 385 

Pro lie Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gin His Asp 
390 395 400 

Tyr lie Asp His Pro Asp Val lie Gly Trp Thr Arg Glu Gly Asp Ser 
405 410 415 

Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro Gly 
420 425 430 

Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly Glu Thr Trp 
435 440 445 

Tyr Asp lie Thr Gly Asn Arg Ser Asp Thr Val Lys lie Gly Ser Asp 
450 455 460 465 

Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser lie Tyr Val 
470 475 480 

Gin Lys 
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SJ..-UI 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 156.. 1802 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 156.. 257 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 258.. 1802 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

AAATTCGATA TTGAAAACGA TTACAAATAA AAATTATAAT AGACGTAAAC GTTCGAGGGT 6 0 

TTGCTCCCTT TTTACTCTTT TTATGCAATC GTTTCCCTTA ATTTTTTGGA AGCCAAACCG 12 0 

TCGAATGTAA CATTTGATTA AGGGGGAAGG GCATT GTG CTA ACG TTT CAC CGC 173 

Val Leu Thr Phe His Arg 
-34 -30 

ATC ATT CGA AAA GGA TGG ATG TTC CTG CTC GCG TTT TTG CTC ACT GTC 221 
lie lie Arg Lys Gly Trp Met Phe Leu Leu Ala Phe Leu Leu Thr Val 
-25 -20 -15 

TCG CTG TTC TGC CCA ACA GGA CAG CCC GCC AAG GCT GCC GCA CCG TTT 26 9 

Ser Leu Phe Cys Pro Thr Gly Gin Pro Ala Lys Ala Ala Ala Pro Phe 
-10 -5 1 

AAC GGC ACC ATG ATG CAG TAT TTT GAA TGG TAC TTG CCG GAT GAT GGC 317 
Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu Pro Asp Asp Gly 
5 10 15 20 

ACG TTA TGG ACC AAA GTG GCC AAT GAA GCC AAC AAC TTA TCC AGC CTT 365 
Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn Leu Ser Ser Leu 
25 30 35 

GGC ATC ACC GCT CTT TGG CTG CCG CCC GCT TAC AAA GGA ACA AGC CGC 413 
Gly lie Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys Gly Thr Ser Arg 
40 45 50 

AGC GAC GTA GGG TAC GGA GTA TAC GAC TTG TAT GAC CTC GGC GAA TTC 461 
Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
55 60 65 

AAT CAA AAA GGG ACC GTC CGC ACA AAA TAC GGA ACA AAA GCT CAA TAT 509 
Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Tyr 
70 75 80 

CTT CAA GCC ATT CAA GCC GCC CAC GCC GCT GGA ATG CAA GTG TAC GCC 557 
Leu Gin Ala lie Gin Ala Ala His Ala Ala Gly Met Gin Val Tyr Ala 
85 90 95 100 

GAT GTC GTG TTC GAC CAT AAA GGC GGC GCT GAC GGC ACG GAA TGG GTG 6 05 

Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly Thr Glu Trp Val 
105 110 115 




GAC GCC GTC GAA GTC AAT CCG TCC GAC CGC AAC CAA GAA ATC TCG GGC 6 53 

Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin Glu lie Ser Gly 

120 125 130 

ACC TAT CAA ATC CAA GCA TGG ACG AAA TTT GAT TTT CCC GGG CGG GGC 7 01 

Thr Tyr Gin lie Gin Ala Trp Thr Lys Phe Asp Phe Pro Gly Arg Gly 
135 140 145 

AAC ACC TAC TCC AGC TTT AAG TGG CGC TGG TAC CAT TTT GAC GGC GTT 74 9 

Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His Phe Asp Gly Val 
150 155 160 

GAT TGG GAC GAA AGC CGA AAA TTG AGC CGC ATT TAC AAA TTC CGC GGC 79 7 

Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg lie Tyr Lys Phe Arg Gly 
165 170 175 180 

ATC GGC AAA GCG TGG GAT TGG GAA GTA GAC ACG GAA AAC GGA AAC TAT . 845 

lie Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu Asn Gly Asn Tyr 
185 190 195 

GAC TAC TTA ATG TAT GCC GAC CTT GAT ATG GAT CAT CCC GAA GTC GTG 8 93 

Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His Pro Glu Val Val 

200 205 210 

ACC GAG CTG AAA AAC TGG GGG AAA TGG TAT GTC AAC ACA ACG AAC ATT 941 

Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn Thr Thr Asn lie 
215 220 225 

GAT GGG TTC CGG CTT GAT GCC GTC AAG CAT ATT AAG TTC AGT TTT TTT 98 9 

Asp Gly Phe Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe Phe 
230 235 240 

CCT GAT TGG TTG TCG TAT GTG CGT TCT CAG ACT GGC AAG CCG CTA TTT 103 7 

Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin Thr Gly Lys Pro Leu Phe 
245 250 255 260 

ACC GTC GGG GAA TAT TGG AGC TAT GAC ATC AAC AAG TTG CAC AAT TAC 108 5 

Thr Val Gly Glu Tyr Trp Ser Tyr Asp lie Asn Lys Leu His Asn Tyr 
265 270 275 

ATT ACG AAA ACA GAC GGA ACG ATG TCT TTG TTT GAT GCC CCG TTA CAC 113 3 

lie Thr Lys Thr Asp Gly Thr Met Ser Leu Phe Asp Ala Pro Leu His 

280 285 ' 290 

AAC AAA TTT TAT ACC GCT TCC AAA TCA GGG GGC GCA TTT GAT ATG CGC 1181 

Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala Phe Asp Met Arg 
295 300 305 

ACG TTA ATG ACC AAT ACT CTC ATG AAA GAT CAA CCG ACA TTG GCC GTC 122 9 

Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro Thr Leu Ala Val 
310 315 320 

ACC TTC GTT GAT AAT CAT GAC ACC GAA CCC GGC CAA GCG CTG CAG TCA 12 7 7 

Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin Ala Leu Gin Ser 
325 330 335 340 

TGG GTC GAC CCA TGG TTC AAA CCG TTG GCT TAC GCC TTT ATT CTA ACT 13 2 5 

Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr 
345 350 355 

CGG CAG GAA GGA TAC CCG TGC GTC TTT TAT GGT GAC TAT TAT GGC ATT 13 73 

Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp Tyr Tyr Gly lie 

360 365 370 
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CCA CAA TAT AAC ATT CCT TCG CTG AAA AGC AAA ATC GAT CCG CTC CTC 1421 
Pro Gin Tyr Asn lie Pro Ser Leu Lys Ser Lys lie Asp Pro Leu Leu 
375 380 385 



ATC GCG CGC AGG GAT TAT GCT TAC GGA ACG CAA CAT GAT TAT CTT GAT 146 9 

lie Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His Asp Tyr Leu Asp 
390 395 400 

CAC TCC GAC ATC ATC GGG TGG ACA AGG GAA GGG GGC ACT GAA AAA CCA 1517 
His Ser Asp lie lie Gly Trp Thr Arg Glu Gly Gly Thr Glu Lys Pro 
405 410 415 420 

GGA TCC GGA CTG GCC GCA CTG ATC ACC GAT GGG CCG GGA GGA AGC AAA 1565 
Gly Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro Gly Gly Ser Lys 
425 430 435 

TGG ATG TAC GTT GGC AAA CAA CAC GCT GGA AAA GTG TTC TAT GAC CTT 1613 
Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val Phe Tyr Asp Leu 
440 445 450 

ACC GGC AAC CGG AGT GAC ACC GTC ACC ATC AAC AGT GAT GGA TGG GGG 1661 
Thr Gly Asn Arg Ser Asp Thr Val Thr lie Asn Ser Asp Gly Trp Gly 
455 460 465 

GAA TTC AAA GTC AAT GGC GGT TCG GTT TCG GTT TGG GTT CCT AGA AAA 17 09 

Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp Val Pro Arg Lys 
470 475 480 

ACG ACC GTT TCT ACC ATC GCT CGG CCG ATC ACA ACC CGA CCG TGG ACT 1757 
Thr Thr Val Ser Thr lie Ala Arg Pro lie Thr Thr Arg Pro Trp Thr 
485 490 495 500 

GGT GAA TTC GTC CGT TGG ACC GAA CCA CGG TTG GTG GCA TGG CCT 1802 
Gly Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val Ala Trp Pro 
505 510 515 

TGATGCCTGC GA 1814 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Val Leu Thr Phe His Arg lie lie Arg Lys Gly Trp Met Phe Leu Leu 
-34 -30 -25 -20 

Ala Phe Leu Leu Thr Val Ser Leu Phe Cys Pro Thr Gly Gin Pro Ala 
-15 -10 -5 

Lys Ala Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp 
15 10 

Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala 
15 20 25 30 

Asn Asn Leu Ser Ser Leu Gly lie Thr Ala Leu Trp Leu Pro Pro Ala 
35 40 45 

Tyr Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu 
50 55 60 
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Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr 
65 70 75 

Gly Thr Lys Ala Gin Tyr Leu Gin Ala lie Gin Ala Ala His Ala Ala 
80 85 90 

Gly Met Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala 
95 100 105 110 

Asp Gly Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg 
115 120 125 

Asn Gin Glu lie Ser Gly Thr Tyr Gin lie Gin Ala Trp Thr Lys Phe 
130 135 140 

Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp 
145 150 155 

Tyr His Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg 
160 165 170 

lie Tyr Lys Phe Arg Gly lie Gly Lys Ala Trp Asp Trp Glu Val Asp 
175 180 185 190 

r~5 Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met 

195 200 205 

\M Asp His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr 

fU 210 215 220 

lL| Val Asn Thr Thr Asn lie Asp Gly Phe Arg Leu Asp Ala Val Lys His 

JlJ 225 230 235 

|y He Lys Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin 

g" 240 245 250 

*J Thr Gly Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He 

CM 255 260 265 270 

jj : 

3*7; Asn Lys Leu His Asn Tyr He Thr Lys Thr Asp Gly Thr Met Ser Leu 

^ 275 280 285 

Phe Asp Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly 
290 295 300 

Gly Ala Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp 
305 310 315 

Gin Pro Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro 
320 325 330 

, Gly Gin Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala 
335 340 345 350 

Tyr Ala Phe He Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly He Pro Gin Tyr Asn He Pro Ser Leu Lys Ser 
370 375 380 

Lys He Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr 
385 390 395 

Gin His Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu 
400 405 410 
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Gly Gly Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu lie Thr Asp 
415 420 425 430 

Gly Pro Gly Gly Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly 
435 440 445 

Lys Val Phe Tyr Asp Leu Thr Gly Asn Arg Ser Asp Thr Val Thr lie 
450 455 460 

Asn Ser Asp Gly Trp Gly Glu Phe Lys Val Asn Gly Gly Ser Val Ser 
465 470 475 

Val Trp Val Pro Arg Lys Thr Thr Val Ser Thr lie Ala Arg Pro lie 
480 485 490 

Thr Thr Arg Pro Trp Thr Gly Glu Phe Val Arg Trp Thr Glu Pro Arg 
495 500 505 510 

Leu Val Ala Trp Pro 
515 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GGTCGTAGGC ACCGTAGCCC CAATCCGCTT G 31 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GGTCGTAGGC ACCGTAGCCC CAATCCCATT GGCTCG 36 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CTGTGACTGG TGAGTACTCA ACCAAGTC 2 8 
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(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Thr Pro Ala Asp Trp Arg Ser Gin Ser lie Tyr Phe Leu Leu Thr 
15 10 15 

Asp Arg Phe Ala Arg Thr Asp Gly Ser Thr Thr Ala Thr Cys Asn Thr 
20 25 30 

Ala Asp Gin Lys Tyr Cys Gly Gly Thr Trp Gin Gly lie lie Asp Lys 
35 40 45 

Leu Asp Tyr lie Gin Gly Met Gly Phe Thr Ala lie Trp lie Thr Pro 
50 55 60 

Val Thr Ala Gin Leu Pro Gin Thr Thr Ala Tyr Gly Asp Ala Tyr His 
65 70 75 80 

Gly Tyr Trp Gin Gin Asp lie Tyr Ser Leu Asn Glu Asn Tyr Gly Thr 
85 90 95 

Ala Asp Asp Leu Lys Ala Leu Ser Ser Ala Leu His Glu Arg Gly Met 
100 105 110 

Tyr Leu Met Val Asp Val Val Ala Asn His Met Gly Tyr Asp Gly Ala 
115 120 125 

Gly Ser Ser Val Asp Tyr Ser Val Phe Lys Pro Phe Ser Ser Gin Asp 
130 135 140 

Tyr Phe His Pro Phe Cys Phe lie Gin Asn Tyr Glu Asp Gin Thr Gin 
145 150 155 160 

Val Glu Asp Cys Trp Leu Gly Asp Asn Thr Val Ser Leu Pro Asp Leu 
165 170 175 

Asp Thr Thr Lys Asp Val Val Lys Asn Glu Trp Tyr Asp Trp Val Gly 
180 185 190 

Ser Leu Val Ser Asn Tyr Ser lie Asp Gly Leu Arg lie Asp Thr Val 
195 200 205 

Lys His Val Gin Lys Asp Phe Trp Pro Gly Tyr Asn Lys Ala Ala Gly 
210 215 220 

Val Tyr Cys lie Gly Glu Val Leu Asp Gly Asp Pro Ala Tyr Thr Cys 
225 230 235 240 

Pro Tyr Gin Asn Val Met Asp Gly Val Leu Asn Tyr Pro lie Tyr Tyr 
245 250 255 

Pro Leu Leu Asn Ala Phe Lys Ser Thr Ser Gly Ser Met Asp Asp Leu 
260 265 270 

Tyr Asn Met lie Asn Thr Val Lys Ser Asp Cys Pro Asp Ser Thr Leu 
275 280 285 
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Leu Gly Thr Phe Val Glu Asn His Asp Asn Pro Arg Phe Ala Ser Tyr 
290 295 300 

Thr Asn Asp lie Ala Leu Ala Lys Asn Val Ala Ala Phe lie lie Leu 
305 310 315 320 

Asn Asp Gly lie Pro lie lie Tyr Ala Gly Gin Glu Gin His Tyr Ala 
325 330 335 

Gly Gly Asn Asp Pro Ala Asn Arg Glu Ala Thr Trp Leu Ser Gly Tyr 
340 345 350 

Pro Thr Asp Ser Glu Leu Tyr Lys Leu lie Ala Ser Ala Asn Ala lie 
355 360 365 

Arg Asn Tyr Ala lie Ser Lys Asp Thr Gly Phe Val Thr Tyr Lys Asn 
370 375 380 

Trp Pro lie Tyr Lys Asp Asp lie Thr lie Ala Met Arg Lys Gly Thr 
385 390 395 400 

Asp Gly Ser Gin lie Val Thr lie Leu Ser Asn Lys Gly Ala Ser Gly 
405 410 415 

Asp Ser Tyr Thr Leu Ser Leu Ser Gly Ala Gly Tyr Thr Ala Gly Gin 
420 425 430 

Gin Leu Thr Glu Val lie Gly Cys Thr Thr Val Thr Val Gly Ser Asp 
435 440 445 

Gly Asn Val Pro Val Pro Met Ala Gly Gly Leu Pro Arg Val Leu Tyr 
450 455 460 

Pro Thr Glu Lys Leu Ala Gly Ser Lys lie Cys Ser Ser Ser 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

<B) LOCATION: 1..1455 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

CAT CAT AAT GGA ACA AAT GGT ACT ATG ATG CAA TAT TTC GAA TGG TAT 4 8 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 

520 525 530 

! 

TTG CCA AAT GAC GGG AAT CAT TGG AAC AGG TTG AGG GAT GAC GCA GCT 96 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 

535 540 545 

AAC TTA AAG AGT AAA GGG ATA ACA GCT GTA TGG ATC CCA CCT GCA TGG 144 
Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
550 555 560 

AAG GGG ACT TCC CAG AAT GAT GTA GGT TAT GGA GCC TAT GAT TTA TAT 192 
Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
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565 570 575 

GAT CTT GGA GAG TTT AAC CAG AAG GGG ACG GTT CGT ACA AAA TAT GGA 24 0 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
580 585 590 595 

ACA CGC AAC CAG CTA CAG GCT GCG GTG ACC TCT TTA AAA AAT AAC GGC 288 
Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
600 605 610 

ATT CAG GTA TAT GGT GAT GTC GTC ATG AAT CAT AAA GGT GGA GCA GAT 3 36 

lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
615 620 625 

GGT ACG GAA ATT GTA AAT GCG GTA GAA GTG AAT CGG AGC AAC CGA AAC 3 84 

Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
630 635 640 

CAG GAA ACC TCA GGA GAG TAT GCA ATA GAA GCG TGG ACA AAG TTT GAT 4 32 

Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
645 650 655 

TTT CCT GGA AGA GGA AAT AAC CAT TCC AGC TTT AAG TGG CGC TGG TAT 480 
Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
660 ^ 665 670 675 

CAT TTT GAT GGG ACA GAT TGG GAT CAG TCA CGC CAG CTT CAA AAC AAA 52 8 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
680 685 690 

ATA TAT AAA TTC AGG GGA ACA GGC AAG GCC TGG GAC TGG GAA GTC GAT 576 
lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
695 700 705 

ACA GAG AAT GGC AAC TAT GAC TAT CTT ATG TAT GCA GAC GTG GAT ATG 624 
Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
710 715 720 

GAT CAC CCA GAA GTA ATA CAT GAA CTT AGA AAC TGG GGA GTG TGG TAT 6 72 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
725 730 735 

ACG AAT ACA CTG AAC CTT GAT GGA TTT AGA ATA GAT GCA GTG AAA CAT 72 0 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
740 745 750 755 

ATA AAA TAT AGC TTT . ACG AGA GAT TGG CTT ACA CAT GTG CGT AAC ACC 768 
lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
760 765 770 

ACA GGT AAA CCA ATG TTT GCA GTG GCT GAG TTT TGG AAA AAT GAC CTT 816 
Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
775 780 785 

GGT GCA ATT GAA AAC TAT TTG AAT AAA ACA AGT TGG AAT CAC TCG GTG 864 
Gly Ala lie Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
790 795 800 

TTT GAT GTT CCT CTC CAC TAT AAT TTG TAC AAT GCA TCT AAT AGC GGT 912 
Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
805 810 ' 815 

GGT TAT TAT GAT ATG AGA AAT ATT TTA AAT GGT TCT GTG GTG CAA AAA 96 0 

Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
820 825 830 835 

CAT CCA ACA CAT GCC GTT ACT TTT GTT GAT AAC CAT GAT TCT CAG CCC 100 8 
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His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
840 845 850 

GGG GAA GCA TTG GAA TCC TTT GTT CAA CAA TGG TTT AAA CCA CTT GCA 10 56 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
855 860 865 

TAT GCA TTG GTT CTG ACA AGG GAA CAA GGT TAT CCT TCC GTA TTT TAT 1104 
Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
870 875 880 

GGG GAT TAC TAC GGT ATC CCA ACC CAT GGT GTT CCG GCT ATG AAA TCT 1152 
Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
885 890 895 

( 

AAA ATA GAC CCT CTT CTG CAG GCA CGT CAA ACT TTT GCC TAT GGT ACG 12 00 

Lys lie Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
900 905 910 915 

CAG CAT GAT TAC TTT GAT CAT CAT GAT ATT ATC GGT TGG ACA AGA GAG 124 8 

Gin His Asp Tyr Phe Asp His His Asp lie lie Gly Trp Thr Arg Glu 
920 925 930 

GGA AAT AGC TCC CAT CCA AAT TCA GGC CTT GCC ACC ATT ATG TCA GAT 12 96 

Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
935 940 945 

GGT CCA GGT GGT AAC AAA TGG ATG TAT GTG GGG AAA AAT AAA GCG GGA 1344 
Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
950 955 960 

CAA GTT TGG AGA GAT ATT ACC GGA AAT AGG ACA GGC ACC GTC ACA ATT 13 92 

Gin Val Trp Arg Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr lie 
965 970 975 

AAT GCA GAC GGA TGG GGT AAT TTC TCT GTT AAT GGA GGG TCC GTT TCG 144 0 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
980 985 990 995 

GTT TGG GTG AAG CAA TAA 14 58 

Val Trp Val Lys Gin 
1000 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

His His Asn Gly Thr Asn Gly Thr Met Met Glh Tyr Phe Glu Trp Tyr 
1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 
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Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala lie Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

Lys lie Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asp lie lie Gly Trp Thr Arg Glu 
405 410 415 
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Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
420 425 430 



Gly Pro Gly Gly Asn 
435 

Gin Val Trp Arg Asp 
450 

Asn Ala Asp Gly Trp 
465 



Lys Trp Met Tyr Val 
440 

lie Thr Gly Asn Arg 
455 

Gly Asn Phe Ser Val 
470 



Gly Lys Asn Lys Ala Gly 

445 

Thr Gly Thr Val Thr lie 
460 

Asn Gly Gly Ser Val Ser 
475 480 



Val Trp Val Lys Gin 
485 



(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



Val 


Asn 


Gly 


Thr 


Leu 


Met 


Gin 


Tyr 


Phe 


Glu 


Trp 


Tyr 


Thr 


Pro 


Asn 


Asp 


1 








5 










10 










15 




Gly 


Gin 


His 


Trp 


Lys 


Arg 


Leu 


Gin 


Asn 


Asp 


Ala 


Glu 


His 


Leu 


Ser 


Asp 








20 










25 










30 






He 


Gly 


He 


Thr 


Ala 


Val 


Trp 


He 


Pro 


Pro 


Ala 


Tyr 


Lys 


Gly 


Leu 


Ser 






35 










40 










45 








Gin 


Ser 


Asp 


Asn 


Gly 


Tyr 


Gly 


Pro 


Tyr 


Asp 


Leu 


Tyr 


Asp 


Leu 


Gly 


Glu 




50 










55 










60 










Phe 


Gin 


Gin 


Lys 


Gly 


Thr 


Val 


Arg 


Thr 


Lys 


Tyr 


Gly 


Thr 


Lys 


Ser 


Glu 


65 










70 










75 










80 


Leu 


Gin 


Asp 


Ala 


He 


Gly 


Ser 


Leu 


His 


Ser 


Arg 


Asn 


Val 


Gin 


Val 


Tyr 
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90 










95 




Gly 


Asp 


Val 


Val 


Leu 


Asn 


His 


Lys 


Ala 


Gly 


Ala 


Asp 


Ala 


Thr 


Glu 


Asp 








100 










105 










110 






Val 


Thr 


Ala 


Val 


Glu 


Val 


Asn 


Pro 


Ala 


Asn 


Arg 


Asn 


Gin 


Glu 


Thr 


Ser 






115 










120 










125 








Glu 


Glu 


Tyr 


Gin 


He 


Lys 


Ala 


Trp 


Thr 


Asp 


Phe 


Arg 


Phe 


Pro 


Gly 


Arg 




130 










135 










140 










Gly 


Asn 


Thr 


Tyr 


Ser 


Asp 


Phe 


Lys 


Trp 


His 


Trp 


Tyr 


His 


Phe 


Asp 


Gly 


145 










150 










155 










160 


Ala 


Asp 


Trp 


Asp 


Glu 


Ser 


Arg 


Lys 


He 


Ser 


Arg 


He 


Phe 


Lys 


Phe 


Arg 










165 










170 










175 




Gly 


Glu 


Gly 


Lys 


Ala 


Trp 


Asp 


Trp 


Glu 


Val 


Ser 


Ser 


Glu 


Asn 


Gly 


Asn 








180 


Met 








185 










190 






Tyr 


Asp 


Tyr 


Leu 


Tyr 


Ala 


Asp 


Val 


Asp 


Tyr 


Asp 


His 


Pro 


Asp 


Val 






195 










200 










205 








Val 


Ala 


Glu 


Thr 


Lys 


Lys 


Trp 


Gly 


He 


Trp 


Tyr 


Ala 


Asn 


Glu 


Leu 


Ser 




210 










215 










220 










Leu 


Asp 


Gly 


Phe 


Arg 


He 


Asp 


Ala 


Ala 


Lys 


His 


He 


Lys 


Phe 


Ser 


Phe 


225 










230 










235 










240 


Leu 


Arg 


Asp 


Trp 


Val 


Gin 


Ala 


Val 


Arg 


Gin 


Ala 


Thr 


Gly 


Lys 


Glu 


Met 










245 










250 










255 




Phe 


Thr 


Val 


Ala 


Glu 


Tyr 


Trp 


Gin 


Asn 


Asn 


Ala 


Gly 


Lys 


Leu 


Glu 


Asn 








260 










265 










270 






Tyr 


Leu 


Asn 


Lys 


Thr 


Ser 


Phe 


Asn 


Gin 


Ser 


Val 


Phe 


Asp 


Val 


Pro 


Leu 






275 










280 










285 








His 


Phe 


Asn 


Leu 


Gin 


Ala 


Ala 


Ser 


Ser 


Gin 


Gly 


Gly 


Gly 


Tyr 


Asp 


Met 




290 










295 










300 










Arg 


Lys 


Leu 


Leu 


Asn 


Gly 


Thr 


Val 


Val 


Ser 


Lys 


His 


Pro 


Leu 


Lys 


Ser 


305 










310 










315 










320 
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Val Thr Phe Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 

325 330 335 

Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu 

340 345 350 

Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 

355 360 365 

Thr Lys Gly Asp Ser Gin Arg Glu lie Pro Ala Leu Lys His Lys lie 

370 375 380 

Glu Pro lie Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His 
385 390 395 400 

Asp Tyr Phe Asp His His Asp lie Val Gly Trp Thr Arg Glu Gly Asp 

405 410 415 

Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 

420 425 430 

Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr 

435 440 445 

Trp His Asp lie Thr Gly Asn Arg Ser Glu Pro Val Val lie Asn Ser 

450 455 460 

Glu Gly Trp Gly Glu Phe His Val Asn Gly Gly Ser Val Ser lie Tyr 
465 470 475 480 

Val Gin Arg 
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